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[Name of the Document] Specification 

[Title of the Invention] INQUIRY REPLY ANALYSIS SYSTEM 
[Scope of Claim for a Patent] 

[Claim l] An inquiry reply analysis system comprising^ 
5 means for entering an inquiry reply statement including free reply 

description in natural language, 

a network for transmitting the inquiry reply statement, 

a database for accumulating the transmitted inquiry reply statements, 

and 

10 a text classification engine for reading out the inquiry reply statements 
from the database and learning a rule for classifying the inquiry reply 
statements. 

[Claim 2] An inquiry reply analysis system comprising^ 

means for entering an inquiry reply statement including free reply 
15 description in natural language, 

a database for accumulating the transmitted inquiry reply statements, 

and 

a text classification engine for reading out the inquiry reply statements 
from the database and learning a rule for classifying the inquiry reply 
20 statements. 

[Claim 3] An inquiry reply analysis system comprising: 

means for entering an inquiry reply statement including free reply 
description in natural language, 

a network for transmitting the inquiry reply statement, 
25 a database for accumulating the transmitted inquiry reply statements, 

a text classification engine for reading out the inquiry reply statements 
from the database and learning a rule for classifying the inquiry reply 
statements, and 

means for distributing the rule through the network according to a 

1 



request from a claimant. 

[Claim 4] An inquiry reply analysis system according to claim 1, 2, or 3, 

wherein said text classification engine includes morpheme analysis means 
for analyzing morphemes in all sentences in the inquiry reply statements 
5 accumulated in the database, category-text designating means for 
designating the category and text, attribute selecting means for selecting 
attributes in plural inquiry reply statements being read in from the 
database, rule learning means for learning the rule for expressing the 
correspondence of the text and category on the basis of the words selected by 

10 attributes by the attribute selecting means, and rule output means for 
issuing the rule learned by the rule learning means. 
[Claim 5] An inquiry reply analysis system according to claim 4, 

wherein said attribute selecting means executes a procedure of computing 
the difference ASCCco) between the stochastic complexity of a test set without 

15 consideration of appearance of word and the stochastic complexity of a text 
set with consideration thereof, in each word appearing in the text, and a 
procedure of selecting as attribute when the difference ASC((o) is lager than 
the threshold x. 

[Claim 6] An inquiry reply analysis system according to claim 4, 
20 wherein said rule learning means executes a procedure of forming the text 
set by replacing with an expression of (di, ci), (d2, 02), (dm, Cm) [where each 
di is a multi-valued discrete vector di = (coii, coi2, coin) (i = 1, m), coij is 1 
when word obtained by attribute selection coj (j = 1, n) appears in the i-th 
text, or 0 otherwise, Ci expresses the value (label) of the category according 
25 to the i-th text and each Ci is 1 when belonging to a specific category, or 0 
otherwise, and m is the number of texts], a procedure of growth process by 
selecting the rules of if-then-else format and sequentially adding to the 
stochastic decision hst by employing the information quantity standard such 
as the extended stochastic complexity minimum principle or stochastic 



complexity minimizing principle, and a procedure of removing the rules one 
by one from the last one of the stochastic decision list, and clipping 
continuously until none should be removed from the viewpoint of the 
extended stochastic complexity minimum principle. 
5 [Claim 7] A recording medium recording a program for making a computer 
function as morpheme analysis means for analyzing morphemes in all 
sentences in the inquiry reply statements accumulated in the database, 
category-text designating means for designating the category and text in the 
text classification engine, attribute selecting means for selecting attributes 

10 in plural inquiry reply statements being read in from the database, rule 
learning means for learning the rule for expressing the correspondence of 
the text and category on the basis of the words selected by attributes by the 
attribute selecting means, and rule output means for issuing the rule 
learned by the rule learning means. 

15 [Detailed Description of the Invention] 
[0001] 

[Technical Field of the Invention] 

The present invention relates to an inquiry reply analysis system, and 
particularly to an inquiry reply analysis system using a text automatic 
20 classification technology, a natural language processing technology, and a 
network utilization technology. 
[0002] 
[Prior Art] 

The operation for extracting general features and tendency from inquiry 
25 reply statements including free reply description in natural language 
obtained through the network such as the Internet has been conventionally 
done almost manually. Text mining tools such as DE-FACTO developed by 
Dentsu (published in leaflet), Keyword Associator of Fujitsu (L Watanabe: 
Divergent thought support system "Keyword Associator" 2nd edition, 
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research group paper of 15th Meeting of System Engineering Group of 
Society of Measurement and Automatic Control of Japan, July 1994), and 
"HIPS" (Watanabe, Miki, Nitta, Sugiyama^ Hybrid thought support system 
HIPS, research group paper of 15th Meeting of System Engineering Group 
5 of Society of Measurement and Automatic Control of Japan, January 1995) 
were used for extracting the relationship of words from the text information. 
However, these tools could not express the features of inquiry reply 
statements in a format of rule. 
[0003] 

10 So far, nothing has been known about the system or service for collecting 
and analyzing inquiry reply statements including free reply description in 
natural language automatically through the network such as the Internet, 
and distributing the analysis results, if necessary, to the claimant. For 
example, in JP H11-066036A or JP H11-143856A, the technology for 

15 inquiring through the network and accumulating the replies in the database 
is disclosed, but features of inquiry reply statements are not extracted in a 
format of rule. 
[0004] 

[Problems that the Invention is to Solve] 
20 In the conventional manual inquiry reply analysis mentioned above, if the 
number of inquiry replies is tremendous, the operation is too inefficient. 
[0005] 

In text mining tools such as DE-FACTO and HIPS, features of inquiry 
replies cannot be extracted in a format of rule, it was not sufficient from the 
25 viewpoint of presentation of compact and easy knowledge. 
[0006] 

The text classification tool used in retrieval is a useful tool for analysis of 
inquiry replies, but it was not used before in analysis of inquiry replies 
including free reply description in natural language. 



[0007] 

It is a first object of the invention to present an inquiry reply analysis 
system capable of automatically presenting knowledge in a compact and 
easy rule from inquiry reply statements including free reply description in 
5 natural language by using a text classification engine. 
[0008] 

It is a second object of the invention, in addition to the above, to present 
an inquiry reply analysis system for distributing analysis results to the 
claimant by automatically extracting the knowledge in the rule format from 
10 the inquiry reply statements collected through the network. 
[0009] 

[Means for Solving the Problems] 

An inquiry reply analysis system of the invention comprises means for 
entering an inquiry reply statement including free reply description in 
15 natural language, a network for transmitting the inquiry reply statement, a 
database for accumulating the transmitted inquiry reply statements, and a 
text classification engine for reading out the inquiry reply statements from 
the database and learning a rule for classifying the inquiry reply statements. 
[0010] 

20 Further, an inquiry reply analysis system of the invention comprises 
means for entering an inquiry reply statement including free reply 
description in natural language, a database for accumulating the 
transmitted inquiry reply statements, and a text classification engine for 
reading out the inquiry reply statements from the database and learning a 

25 rule for classifying the inquiry reply statements. 
[0011] 

Moreover, an inquiry reply analysis system of the invention comprises 
means for entering an inquiry reply statement including free reply 
description in natural language, a network for transmitting the inquiry 



reply statement, a database for accumulating the transmitted inquiry reply 
statements, a text classification engine for reading out the inquiry reply 
statements from the database and learning a rule for classifying the inquiry 
reply statements, and means for distributing the rule through the network 
5 according to a request from a claimant. 
[0012] 

[Mode for Carrying Out the Invention] 

Embodiments of the invention are described in detail below while 

referring to the accompanying drawings. 
10 [0013] 

(1) First Embodiment 

Fig. 1 is a block diagram showing a configuration of an inquiry reply 

analysis system according to a first embodiment of the invention. The 

inquiry reply analysis system of the embodiment mainly comprises 
15 respondent computers 111 to UN (N being a positive integer), a network 12, 

a database 13, and a text classification engine 14. 

[0014] 

The respondent computers 111 to UN are computers, portable 
information terminals, cellular phones, and other devices having 
20 transmission function of message, mail and the like, which are connected to 
the network 12. 
[0015] 

The network 12 includes various networks, whether wired or wireless, 
such as public networks, exclusive networks, or LAN (local area network). 
25 [0016] 

The database 13 is connected to the network 12, and inquiry reply 
statements from plural respondents transmitted from the respondent 
computers 111 to UN through the network 12 are accumulated herein. 
[0017] 



The text classification engine 14 reads out plural inquiry reply statements 
from the database 13, extracts a rule for classifying the inquiry reply 
statements, and issues the rule to the claimant. The text classification 
engine 14 includes morpheme analysis means 15 for analyzing morphemes 
5 in all sentences in the inquiry reply statements accumulated in the 
database 13, category-text designating means 16 for designating the 
category and text in the text classification engine 14, attribute selecting 
means 17 for selecting attributes in plural inquiry reply statements being 
read in from the database 13, rule learning means 18 for learning the rule 
10 for expressing the correspondence of text and category on the basis of the 
words selected by attributes by the attribute selecting means 17, and rule 
output means 19 for issuing the rule. 
[0018] 

The text classification engine 14 is an engine for learning the 
15 corresponding relation of the category and text as a classification rule, and, 
for example, an engine proposed by Li and Yamanishi can be used (H. Li and 
K. Yamanishi^ Text Classification Using ESC-based Stochastic Decision 
Lists, Proceedings of 1999 International Conference on Information & 
Knowledge Management, pp. 122-130, 1999). This text classification 
20 engine basically conforms to the system of "Forming method and apparatus 
of decision list" disclosed in Japanese Patent No, 2581196. 
[0019] 

Fig. 2 shows a composition of a set of inquiry reply statements 
accumulated in the database 13. Each row expresses an inquiry item, and 
25 each line shows the reply statement of one person. 
[0020] 

Referring to Fig. 3, processing of the text classification engine 14 
comprises morpheme analysis step 31, designating step 32 of text and 
category, attribute selecting step 33, rule learning step 34, and rule output 



step 35. 
[0021] 

Referring to Fig. 4, a more specific processing of attribute selecting step 
33 includes ASCCco) computing step 41, and attribute selecting step 42. 
5 [0022] 

Referring to Fig. 5, a more specific processing of rule learning step 34 
includes data forming step 51, growth processing step 52, and clip 
processing step 53. 
[0023] 

10 Fig. 6 is a diagram showing an example of rule format knowledge 
(stochastic decision list) as a result of analysis by the text classification 
engine 14. 
[0024] 

Fig. 7 is a diagram showing other example of rule format knowledge 
15 (stochastic decision list) as a result of analysis by the text classification 
engine 14. 
[0025] 

In the inquiry reply analysis system of the first embodiment having such 
configuration, the operation is explained below, 
20 [0026] 

When inquiry respondents send inquiry reply statements from the 
respondent computers 111 to UN, the inquiry reply statements are 
communicated in the database 13 through the network 12. Suppose the 
number of respondents to be N. At this time, the inquiry reply statements 
25 may include free reply description in natural language. 
[0027] 

The text classification engine 14, first by the morpheme analysis means 
15, analyzes morphemes in all sentences of inquiry reply statements 
accumulated in the database 13 (step 31). 



[0028] 

Next, by the category-text designating means 16, the text classification 
engine 14 causes the operator to designate the category and text in the 
inquiry reply statements (step 32). Herein, designation of category is to 
5 classify by paying attention to the replies in one row. For example, it is the 
category designation that, relating to the first row in Fig. 2, the replies are 
classified into "company A", and "other than company A". The text 
designation is to designate the rows to be used in analysis except for the row 
used in category designation. For example, the text is designated by 
10 selecting the second row in Fig .2. 
[0029] 

Further, the text classification engine 14, by the attribute selecting means 
17, selects the attributes in plural inquiry reply statements being read in 
from the database 13 (step 33). The attribute selection is to select a word 
15 which is important for expressing the correspondence of text and category. 
[0030] 

Then, the text classification engine 14 learns the rule for expressing the 
correspondence of text and category on the basis of the word selected by 
attribute by the rule learning means 18 (step 34). For example, when the 
20 category and text are designated as stated above, the rule is obtained as 
shown in Fig. 6. 
[0031] 

The rule in Fig. 6 shows that if the word "easy to use" is found in the text 
by reading the first line, 92.0% of the respondents assume company A as the 
25 high-tech enterprise. If the word "easy to use" is not found, next, checking 
if the words "future" and "private" appear at the same time, and when found, 
it means that 87.2% of the respondents assume company A as the high-tech 
enterprise. Thereafter, similarly, according the rule of if-then-else pattern, 
the conditional sentences are read from top to bottom. Such rule is an easy 



and compact expression of the relation between the high-tech enterprise and 

high-tech feeling. 

[0032] 

Picking up other company B, when the category is designated into 
5 "company B" and "other than company B", the rule in Fig. 7 is obtained by 
the same procedure. 
[0033] 

Comparing the rule of company B in Fig. 7 with the rule of company A in 
Fig. 6, the high-tech feeling of the respondents assuming company A as the 
10 high-tech enterprise is mainly based on the ease of use and preference 
sensation, while the high-tech feeling of the respondents assuming company 
B as the high-tech enterprise is known to be mainly based on the efficiency. 
Thus, by comparing the rules, the inquiry replies can be analyzed easily. 
[0034] 

15 Finally, the text classification engine 14, by the rule output means 19, 
issues the knowledge of the analysis result in the rule format to the 
claimant (step 35). 
[0035] 

As an example of knowledge in rule format, herein, the stochastic decision 
20 rule is discussed, and the attribute selecting step 33 for creating it and the 
rule learning step 34 are more specifically described below, 
[0036] 

The stochastic decision list is a ranked list of stochastic rule of if-then 
pattern as shown in Fig. 6, Each stochastic rule has a pattern of "c = 1 <— t 
25 (probability p)", where c = 1 is the decision of classification, t is the condition 
of classification, and (probability p) is the probability. 
[0037] 

First, the attribute selecting step 33 is explained. 
[0038] 



The attribute selection is to collect words closely related with the category 
in the given category (for example, company A and other than company A). 
More specifically, as shown in Fig. 4, at step 41, in each word co appearing in 
the text, the difference ASC(co) between the stochastic complexity (SC) of the 
5 text set without consideration of onset of the word co and the SC with 
consideration thereof is computed, and at step 42, when the difference 
ASC(co) is greater than the given threshold x, the word co is selected as an 
attribute. 
[0039] 

10 A practical method of computing the SC is explained. Sets of texts in the 
entered inquiry reply statements are expressed as 

(dl, Cl), (d2, C2), (dm, Cm) 

where di denotes the i-th text, and is expressed as the row of words 
appearing in the i-th text, Ci denotes the value of category (label) 
15 corresponding to the i"th text, and each Ci is 1 if belonging to the given 
category (company A) or 0 otherwise (other than company A), and m is the 
number of texts. 
[0040] 

Further, label rows are expressed as c"^ = ci, Cm, and text rows are d"^ = 
20 dl, dm. The SC of label row c"^ is calculated as in formula 1. 
[0041] 
(1) 



log-— 4- log:^ 
2 2.7r 



[0042] 

25 where m"^ is the number of labels in which the value is 1 in label row c"', and 
log is the natural logarithm, which is expressed in formula 2. 
[0043] 
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(2) 

def 

Hiz) = - zloQ z -(I- z) log( 1- z) 

[0044] 

For example, as discussed by J. Rissanen and Fisher information and 
5 stochastic complexity (IEEE Trans, on Information Theory, 42 (l), 40-47, 
1996), SCCc™) is the shortest description length for describing the label row 
c™ by using the given model (herein, Bernoulli model). Suppose 
[1] 

10 is a label row composed of label Ci in which word co appears in the 
corresponding text di, where nio) is the number of labels in 
[2] 

Then/the value of SC in 
15 [3] 

can be calculated as shown in formula 3. 

[0045] 

(3) 

20 

[0046] 

where mo,"^ is the number of labels of which value is 1 in 
[4] 

^mcD 

12 
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On the other hand, suppose 
[5] 

is a label row composed of label Ci in which word o> does not appear in the 

corresponding text di, where nia, is the number of labels in 

[6] 

[0047] 

Then, the value of SC in 
[7] 

can be calculated as shown in formula 4. 

[0048] 

(4) 



15 



20 



r 



m 



+ N 



^1 w ^, 
-f-log-^+Iog;r 



[0049] 

The difference ASC(co) between the SC without consideration of 
appearance of word oo and the SC with consideration thereof is calculated as 
shown in formula 5. 
[0050] 
(5) 
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[0051] 

The word co large in the difference ASC(co) is a word appearing very 
frequently or hardly in a given category. Such words are considered to be 
closely related with the category. Supposing x to be a given threshold, the 
word CO in the relation of ASC(o)) > x is selected as an attribute. 
[0052] 

The rule learning step 34 is explained below, 
[0053] 

Suppose there are n words selected of attribute, being coi, con. At step 
51, first of all, sets of entered texts are expressed as follows. 

(dl, Cl), (d2, C2), (dm, Cm) 

[0054] 

where each di expresses a binary vector (generally, a multi-valued discrete 
vector) 

di = (coii, coi2, coin) (i = 1, m) 
where coij is 1 when the word coj obtained by attribute selection appears in 
the i-th text, or 0 otherwise (j = 1, n), Ci expresses the value (label) of the 
category corresponding to the i-th text, and each Ci is 1 when belonging to 
the specified category, and 0 otherwise, and m is the number of texts. 
[0055] 

At step 52, the rule of if-then-else pattern is selected, and sequentially 
added to the stochastic decision list A. This is called "growth." For 

14 



selection of rules, for example, the extended stochastic complexity (ESC) 

minimum principle is employed. 

[0056] 

The operation is as follows. Suppose k is a given positive integer. A set 
5 of all possible k terms (up to k link words of word co) on the basis of the word 
CO by attribute selection is supposed to be T. From terms t of the set T, 
those not appearing in the text at all are excluded. An empty stochastic 
decision list A is prepared. Next, the rule of the largest decrement of ESC 
value is sequentially added to the stochastic decision list A. 
10 [0057] 

Herein, the ESC is computer as follows. The whole data set D is 
expressed as sets of data in a format of 

(dl, Cl), (d2, C2), (dm, Cm) 

and label rows are c"^ = ci, Cm. The value of the ESC of label row Cm can 
15 be approximated as in formula 6. 
[0058] 
(6) 

ESC (c- ) = Loss ic- ) + A 4m log m 

[0059] 

20 This is one approximate format of the original ESC proposed by K. 
Yamanishi in his paper (A decision-theoretic extension of stochastic 
complexity and its applications to learning, IEEE Trans. Inform. Theory, 44, 
1424-1439, 1998), 
[0060] 

25 Herein, >^ is a positive constant, Loss (c'") is the number of errors in 
default classification. The default classification is to assume all labels are 
0, for example, t is a term in the set T. Suppose 
[0061] 



[8] 
^mt 

is a label row composed of label Ci in which term t is true in the 

corresponding text di, where mt is the number of labels in 

5 [9] 
^mt 

[0062] 

Suppose 
[10] 

10 Loss ( c™* ) 

is the number of errors when classifying by term t. On the other hand, 

[11] 

is a label row composed of label Ci in which term t is false in the 
15 corresponding text di, where m-'t is the number of labels in 
[12] 

and --t expresses negation of term t. Suppose 
[0063] 
20 [13] 

Loss (C ) 

is the number of errors when classifying by --t. The ESC values of 
[0064] 
[14] 
25 C""' 
and 
[15] 

can be calculated in formula 7 and formula 8, respectively. 
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[0065] 
(7) 

ESC (c'"' ) = Loss (c'"' ) + Z^m, log m, 

[0066] 
5 (8) 

ESC (c"'- ) = loss (c"- ) + A yjm_^^ log m_^, 

[0067] 

When classifying by term t, the decrement AESC(t) of the ESC value is 
calculated in formula 9. 
10 [068] 
(9) 

A£:^C (0 = ESC {c'" ) - {ESC (c"' ) + ESC (c'"- )) 
= [loss{c'") - Loosic'"^ ) - Loosic'"- )] 
4- [A(^m\ogm - -^m, log - ^m^^ log m^, )] 

[0069] 

According to the ESC minimum principle, term t is selected so that 
15 AESC(t) may be minimum. When such t = t* is selected, the number of 
data of whole data set D in which it is true is supposed to be 
[16] 

nit* 

and of such data, the label, for example, greater in number is supposed to be 
20 c = 1, and the number of c = 1 is supposed to be 
[17] 

nit* 

and the number of c = 0 is supposed to be 

17 




[18] 
nit* 

Accordingly, the rule "c = 1 ^ t* (probability)" is added to the stochastic 
decision list A. Herein, the probability value p is calculated, for example, 
5 as shown in formula 10 by using the method of Laplacean estimation. 
[0070] 
(10) 

im^^ + 0,5)/(w^. + I) 

[0071] 

10 Excluding term t* from the set T, a new set T is obtained, and excluding 
all data rendering term t* true from the whole data set D, a new whole data 
set D is obtained, and the same operation is repeated until the whole data 
set D becomes empty. Instead of the standard ESC used hereabove, the 
standard SC used in attribute selection may be used. 

15 [0072] 

At step 53, since the stochastic decision list A obtained at step 52 may 
excessively conform to the learning data, the rules are removed one by one 
from the last one of the stochastic decision list A consecutively until none 
should be removed from the viewpoint of the ESC minimum principle. This 
20 process is called "clipping". 
[0073] 

In this case, the manner of application of the ESC minimum principle is 
explained below. First, the value of the ESC corresponding to the 
stochastic decision list A of label row oP^ is defined as the sum of ESC values 
25 corresponding to all terms t in the stochastic decision list A as shown in 
formula 11. 
[0074] 
(11) 

18 
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ESC ic"'\A) = 2] ESC ic"' ) 

r 

[0075] 

However, 
[19] 

5 ESC( c '"^ ) 

is calculated as in formula 7. Next, the whole ESC value of label row c™ 

and stochastic decision list A are defined as in formula 12. 

[0076] 

(12) 

ESC{c"' : A) = ESCic'"\A) + A' L{A) 

= J] Loss(c'"* ) + ^m, log m, + A' Z(^) 

10 

[0077] 

where X' is a positive constant, and L(A) is a code length for encoding the 
stochastic decision list A. Specifically, it is calculated as L(A) = logT + 
log(T-l) + ... + logT (T-i+l), where T is the number of possible terms t, and i 
15 is the number of rules in the stochastic decision list A. 
[0078] 

Suppose A expresses the stochastic decision list before chpping, and A' is 
the stochastic decision list after clipping. 
ESCfc-^lA) ^ ESCCc-nlAO 
20 [0079] 

That is, as far as 

ESC (C^IA) ■ ESC (cm|A) ^ (L(A) - L(A')) 
is established, the clipping procedure continues, and when this condition is 
no longer satisfied or there is no rule left to be clipped, the stochastic 
25 decision list A obtained at this moment is delivered. Thus, the stochastic 

19 




decision list A small in the ESC on the whole is issued. 
[0080] 

In the inquiry reply analysis system of the first embodiment, rules of 
analysis results can be automatically extracted from the inquiry reply 
5 statements including free reply description in natural language collected 
through the network 12. 
[0081] 

In the inquiry reply analysis system of the first embodiment, as the text 
classification engine 14, by using the engine proposed by Li and Yamanishi 

10 (H. Li and K. Yamanishi: Text Classification Using ESC-based Stochastic 
Decision Lists, Proceedings of 1999 International Conference on Information 
& Knowledge Management, pp. 122-130, 1999), by the computation amount 
of O (n*^m), rules can be extracted from the inquiry reply statements at high 
speed, where n is the number of words of attribute selection from the 

15 inquiry reply statements, m is the number of inquiry reply statements, and 
k is the maximum number of words included in the link words relating to 
one condition. Hence, efficient automatic analysis of inquiry reply 
statements is possible. The obtained rules can express the inquiry reply 
statements belonging to a specific category in compact and easy format of if- 

20 then-else pattern. 
[0082] 

The inquiry reply analysis system of the first embodiment can be applied, 
for example, in the following business. Receiving orders for enterprise 
image survey or inquiry about specific merchandise or service from clients, 
25 the inquiry of the items as shown in Fig. 2 is operated on the network 12, 
and the inquiry reply statements including free reply description in natural 
language collected online through the network 12 are accumulated in the 
database 13, and inquiry reply statements are called therefrom, and the 
rules obtained by using the text classification engine 14 are sold to the 

20 




clients as the analysis results. 
[0083] 

(2) Second Embodiment 

Fig. 8 is a block diagram showing a configuration of an inquiry reply 
5 analysis system according to a second embodiment of the invention. The 
inquiry reply analysis system of the embodiment mainly comprises inquiry 
reply input means 81, a database 82, and a text classification engine 83. 
[0084] 

The inquiry reply input means 81 is directly connected to the database 82 
10 without connecting through network. 
[0085] 

The database 82 accumulates inquiry reply statements from plural 
inquiry respondents. 
[0086] 

15 The text classification engine 83 is exactly the same as the text 
classification engine 14 in the inquiry reply analysis system of the first 
embodiment shown in Fig. 1. Therefore, the corresponding parts are 
identified with same reference numerals, and their detailed description is 
omitted. 

20 [0087] 

The operation of the inquiry reply analysis system of the second 
embodiment having such configuration is explained below. 
[0088] 

The inquiry reply input means 81 is directly connected to the database 82 
25 without connecting through network, and receives inquiry reply statements 
including free reply description in natural language. 
[0089] 

The database 82 accumulates inquiry reply statements from plural 
inquiry respondents. 

21 



[0090] 

The text clavssification engine 83 reads out plural inquiry reply statements 
from the database 82, extracts the rules for classifying the inquiry reply 
statements, and issues the rules of analysis result to the claimant. The 
detail of the operation of the text classification engine 83 is same as that of 
the text classification engine 14 of the inquiry reply analysis system of the 
first embodiment, and the detailed description is omitted. 
[0091] 

The inquiry reply analysis system of the second embodiment can be 
applied, for example, in the following business. Receiving orders for 
enterprise image survey or inquiry about specific merchandise or service 
from clients, the inquiry of the items as shown in Fig. 2 is operated, and the 
inquiry reply statements including free reply description in natural 
language are collected at once, and accumulated in the database 82, and 
inquiry reply statements are called therefrom, and the analysis results 
obtained by using the text classification engine 83 are sold to the clients. 
[0092] 

(3) Third Embodiment 

Fig. 9 is a block diagram showing a configuration of an inquiry reply 
analysis system according to a third embodiment of the invention. The 
inquiry reply analysis system of the embodiment mainly comprises 
respondent computers 911 to 91N, a network 92, a database 93, a text 
classification engine 94, and a claimant computer 95. 
[0093] 

The respondent computers 911 to 91N are computers, portable 
information terminals, cellular phones, and other devices having 
transmission function of message, mail and the like, which are connected to 
the network 92. 
[0094] 
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The network 92 includes various networks, whether wired or wireless, 
such as public networks, exclusive networks, or LAN. 
[0095] 

The database 93 is connected to the network 92, and inquiry reply 
5 statements from plural respondents transmitted from the respondent 
computers 911 to 9 IN through the network 92 are accumulated herein. 
[0096] 

The text classification engine 94 is same as the text classification engine 
14 in the inquiry reply analysis system of the first embodiment shown in Fig. 
10 1, except that the rule output means 19 can transmit the knowledge of the 
rule format as a result of analysis through the network 92. Therefore, 
same reference numerals are given to the corresponding parts and detailed 
description is omitted. 
[0097] 

15 The claimant computer 95 requests knowledge of rule format as a result 
of analysis to the text classification engine 94 through the network 92, and 
receives the knowledge of rule format of analysis result from the text 
classification engine 94 through the network 92. 
[0098] 

20 The operation of the inquiry reply analysis system of the third 
embodiment having such configuration is explained below. 
[0099] 

The inquiry respondents send inquiry reply statements including free 
reply description in natural language from respondent computers 911 to 
25 91N through the network 92. Suppose the number of respondents to be N. 
[0100] 

The database 93 is connected to the network 92, and accumulates inquiry 
reply statements from plural inquiry respondents. 
[0101] 



The text classification engine 94 reads out plural inquiry reply statements 
from the database 93, an extracts the knowledge of rule format for 
classifying the inquiry reply statements. The text classification engine 94 
is connected to the network 92, and distributes the knowledge of rule format 
5 of analysis result through the network 92 depending on the request from the 
claimant computer 95. The detail of operation of the text classification 
engine 94 is same as that of the text classification engine 14 of the inquiry 
reply analysis system of the first embodiment, except that the knowledge of 
rule format of analysis result is distributed through the network 92, and the 
10 description of detail is omitted. 
[0102] 

The inquiry reply analysis system of the third embodiment can be applied, 
for example, in the following business. Receiving orders for enterprise 
image survey or inquiry about specific merchandise or service from clients, 

15 the inquiry of the items as shown in Fig. 2 is operated on the network 92, 
and the inquiry reply statements including free reply description in natural 
language collected online through the network 92 are accumulated in the 
database 93, and inquiry reply statements are called therefrom, and the 
analysis results obtained by using the text classification engine 94 are 

20 distributed through the network 92 to the clients when requested. 
[0103] 

(4) Fourth Embodiment 

Fig. 10 is a block diagram showing a configuration of an inquiry reply 
analysis system according to a fourth embodiment of the invention. The 
25 inquiry reply analysis system of the embodiment is similar to the inquiry 
reply analysis system of the first embodiment shown in Fig. 1, except that a 
recording medium 102 recording a text classification engine program is 
incorporated in a computer 101 connected to the database 13, and the other 
composition is same as that of the inquiry reply analysis system of the first 
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embodiment, and corresponding parts are identified with same reference 

numerals and detailed description is omitted. 

[0104] 

In the inquiry reply analysis system of the fourth embodiment having 
5 such configuration, the text classification engine program is read into the 
computer 101 from the recording medium 102, and controls the operation of 
the computer 101 as the text classification engine 14 including the 
morpheme analysis means 15, category-text designating means 16, attribute 
selecting means 17, rule learning means 18, and rule output means 19. 
10 The detail of operation of the text classification engine 14 on the computer 
101 is exactly same as in the case of the inquiry reply analysis system of the 
first embodiment, and detailed description is omitted. 
[0105] 

(5) Fifth Embodiment 

15 Fig. 11 is a block diagram showing a configuration of an inquiry reply 
analysis system according to a fifth embodiment of the invention. The 
inquiry reply analysis system of the embodiment is similar to the inquiry 
reply analysis system of the second embodiment shown in Fig. 8, except that 
a recording medium 112 recording a text classification engine program is 

20 incorporated in a computer 111 connected to the database 82, and the other 
composition is same as that of the inquiry reply analysis system of the 
second embodiment, and corresponding parts are identified with same 
reference numerals and detailed description is omitted. 
[0106] 

25 In the inquiry reply analysis system of the fifth embodiment having such 
configuration, the text classification engine program is read into the 
computer 111 from the recording medium 112, and controls the operation of 
the computer 111 as the text classification engine 83 including the 
morpheme analysis means 15, category -text designating means 16, attribute 
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selecting means 17, rule learning means 18, and rule output means 19. 
The detail of operation of the text classification engine 83 on the computer 
111 is exactly same as in the case of the inquiry reply analysis system of the 
second embodiment, and detailed description is omitted. 
5 [0107] 

(6) Sixth Embodiment 

Fig. 12 is a block diagram showing a configuration of an inquiry reply 
analysis system according to a sixth embodiment of the invention. The 
inquiry reply analysis system of the embodiment is similar to the inquiry 

10 reply analysis system of the third embodiment shown in Fig. 9, except that a 
recording medium 122 recording a text classification engine program is 
incorporated in a computer 121 connected to the database 93, and the other 
composition is same as that of the inquiry reply analysis system of the third 
embodiment, and corresponding parts are identified with same reference 

15 numerals and detailed description is omitted. 
[0108] 

In the inquiry reply analysis system of the sixth embodiment having such 
configuration, the text classification engine program is read into the 
computer 121 fi:om the recording medium 122, and controls the operation of 

20 the computer 121 as the text classification engine 94 including the 
morpheme analysis means 15, categorytext designating means 16, attribute 
selecting means 17, rule learning means 18, and rule output means 19. 
The detail of operation of the text classification engine 94 on the computer 
121 is exactly same as in the case of the inquiry reply analysis system of the 

25 third embodiment, and detailed description is omitted. 
[0109] 

[Effects of the Invention] 
[0110] 

According to a first effect of the invention, by receiving orders for 

26 



enterprise image survey or inquiry about specific merchandise or service 
from clients, the inquiry is operated on the network, and the inquiry reply 
statements including free reply description in natural language collected 
online through the network are accumulated in the database, and inquiry 
5 reply statements are called therefrom, and the rules obtained by using the 
text classification engine are sold to the clients as the analysis results. 
[0111] 

According to a second effect of the invention, by receiving orders for 
enterprise image survey or inquiry about specific merchandise or service 

10 from clients, the inquiry is operated, and the inquiry reply statements 
including free reply description in natural language are collected at once, 
and accumulated in the database, and inquiry reply statements are called 
therefrom, and the analysis results obtained by using the text classification 
engine are sold to the clients. 

15 [0112] 

According to a third effect of the invention, by receiving orders for 
enterprise image survey or inquiry about specific merchandise or service 
from clients, the inquiry is operated on the network, and the inquiry reply 
statements including free reply description in natural language collected 

20 online through the network are accumulated in the database, and inquiry 
reply statements are called therefrom, and the analysis results obtained by 
using the text classification engine are distributed through the network to 
the clients when requested. 
[Brief Description of the Drawings] 

25 [Fig. 1] 

It is a block diagram showing a configuration of an inquiry reply analysis 
system according to a first embodiment of the present invention. 
[Fig. 2] 

It is a diagram showing an example of inquiry reply statements 
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accumulated in a database in Fig. 1. 
[Fig. 3] 

It is a flowchart showing processing in a text classification engine in Fig. 

1. 

5 [Fig. 4] 

It is a flowchart showing a more specific processing of attribute selecting 
step in Fig. 3. 
[Fig. 5] 

It is a flowchart showing a more specific processing of rule learning step 
10 in Fig. 3. 
[Fig. 6] 

It is a diagram showing an example of rule format knowledge (stochastic 
decision list) as a result of analysis by the text classification engine in Fig. 1. 
[Fig. 7] 

15 It is a diagram showing other example of rule format knowledge 
(stochastic decision list) as a result of analysis by the text classification 
engine in Fig. 1. 
[Fig. 8] 

It is a block diagram showing a configuration of an inquiry reply analysis 
20 system according to a second embodiment of the present invention. 
[Fig. 9] 

It is a block diagram showing a configuration of an inquiry reply analysis 
system according to a third embodiment of the present invention. 
[Fig. 10] 

25 It is a block diagram showing a configuration of an inquiry reply analysis 
system according to a fourth embodiment of the present invention. 
[Fig. 11] 

It is a block diagram showing a configuration of an inquiry reply analysis 
system according to a fifth embodiment of the present invention. 
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[Fig. 12] 

It is a block diagram showing a configuration of an inquiry reply analysis 
system according to a sixth embodiment of the invention. 
[Explanation of Reference Numerals] 
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13 DATABASE 

14 TEXT CLASSIFICATION ENGINE 
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[Name of the Document] Abstract 
[Abstract] 

[Problem] To analyze inquiry replies automatically from inquiry reply 
statements including free reply description in natural language collected 
through a network, by using a text classification engine, and to issue the 
analysis results of the claimant as the knowledge of rule formats. 
[Solving Means] Inquiry respondents send inquiry reply statements from 
respondent computers 111 to UN. Inquiry reply statements are 
accumulated in a database 13 through a network 12. A text classifi.cation 
engine 14 reads out accumulated inquiry reply statements from the 
database 13, and learns the rules for classifying the inquiry reply 
statements, and issues to the claimant. 
[Selected Drawing] Fig. 1 
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